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AN APPROACH TO CONTAMINATION IDENTIFICATION 

SUMMARY 


A regression analysis method has been applied to unfold the waveforms 
resulting from complex gaseous mixtures in mass spectral analysis. The 
output format specifies both qualitative and quantitative chemical analysis. 

This method of data analysis is applied towards optical contamination identi- 
fication in vacuum systems. A number of test mixtures are presented to demon- 
strate the feasibility of this approach. In particular, a rather difficult test 
case of normal butane, isobutane, and 2-cis butane is successfully analyzed. 

The basic theory of analysis is discussed and the limitations inherent 
in regression calculations on a small computer system are indicated in a 
number of theoretical problems . 

The equipment used only allowed this effort to be considered as a 
feasibility study, and the results indicate most promising analysis with 
research -grade instrumentation. 


INTRODUCTION 


Optical contamination has been arbitrarily divided into two broad 
categories: particulate effects and gaseous effects. The following discussion 
concerns gaseous contamination. 

Experiments dealing with the detrimental effects of gaseous contamina- 
tion quite often are accompanied by spectra resulting from either a residual 
gas analyzer or a mass spectrometer. 

These spectra were difficult to analyze (i.e. , their meaning or effect 
on the optical experiment) . This lack of interpretation provided the motivation 
to determine what information could be actually derived from a mass scan and 
how such information might aid in eliminating those gaseous compounds that 
cause detrimental optical effects. 



Additional incentive was supplied by the flight experiment designated 
T-030, which represents a flight-type quadrupole mass spectrometer with a 
range from 0 to 100 amu. This experiment has been suggested as a possible 
in situ contamination monitor to be flown in conjunction with the optical con- 
tamination experiment on board an Apollo Applications Program (AAP) mission. 


STATEMENT OF THE PROBLEM 


The problem of obtaining information from mass spectra was then 
formulated into an attempt to answer the basic question of quantitative analysis. 
Quantitative mass spectral analysis presents the following areas of question: 

1. Qualitative — Chemical identification of compounds in an unknown 
spectrum 

2. Quantitative analysis 

a. Relative quantitative analysis — Percent composition of each 
of the above compounds 

b. Absolute quantitative analysis — Partial pressure, mass or 
number of atoms of each of the above compounds 

3. Residuals — Percent of the mixture that remains unknown after 
the above analysis 


APPROACH TO SOLUTION OF PROBLEM 


After a study of a number of texts [i, 2, 3] on mass analysis and a 
thorough review and exercise of mass interpretation by Wadsworth and 
Tunnicliff [4], a computerized multiple regression approach was adopted that 
was considered to be consistent with equipment available to this project. 
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JUSTIFICATION 


The four basic assumptions in quantitative mass spectral analysis, first 
developed and verified by Washburn, Wiley, and Rock [5] around 1943 to 1945, 
are as follow: 

1 . Most chemical compounds in nature have their own unique set of 
pattern coefficients (cracking patterns) . Pattern coefficients result from a 
normalization of the intensity at the allowed specific mass numbers for a 
compound (Table 1) . Pattern coefficients have this unique behavior because 
of the nature of chemical valences and bondings existing between atoms 

( Table 2) . 

2. Pattern coefficients are stable under controlled experimental 
conditions. This fact is evident by how five different laboratories using 
different type spectrometers measured the same compound (Table 3) . The 
degree of dissimilarity shown indicates the desirability of calibration data for 
a particular instrument over tabulated data in compilations such as ASTM [6] 
or Cornu and Massot [ 7] . 

3. The measured intensities of the ion beams of the various components 
in a mixture are proportional to the partial pressure of the respective compo- 
nents. This is because the output intensity is designed to be a linear function 
of the number of ions collected, and the ideal gas law is assumed valid in the 
pressure region where mass spectrometers work. Since pressure is the 
number of ions per unit volume collected multiplied by the product of Boltzmann's 
constant and the absolute temperature of the ions, mass spectrometers are 
partial pressure gauges. 

4. Mixture spectra obey the principle of linear superposition. This is 
true because mass spectrometers are partial pressure meters, and Dalton's 
Law of Partial Pressure is well obeyed below iO -4 torr. 

Dalton's law states that if several gases are present in a chamber 
simultaneously, the total pressure of the mixture is the sum that would be 
exerted if each gas were present separately. Applied to each peak in a mixture 
spectrum, this principle indicates that the total intensity at a specific mass 
value is the sum that would be measured if each gas were present separately. 
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DISCUSSION 


A mixture spectrum at one specific mass value can mathematically be 
represented as a hyperplane. At all specific mass values occurring ina mix- 
ture, the problem becomes one of a set of simultaneous linear -nonhomogenous 
equations. In most cases, there are more equations than unknowns, the 
unknowns being the percentage composition of the various compounds in the 
mixture. The solution to such equations can be well approximated by the 
least squares technique of multiple linear regression [8] . 

Library Data 


In attempting to analyze an unknown spectrum, a large library containing 
the pattern coefficients of pure compounds is essential. The compounds in this 
library are compared against the unknown. A number of logical decisions and 
regression calculation are performed to determine if a particular compound or 
group of compounds in the library belongs in the unknown mixture spectrum. 
This procedure is carried out until a calculated spectrum agrees favorably 
with the experimentally measured spectrum. 

The ASTM Committee E-14 [6] and Cornu and Massot [7] libraries 
represent sources of vast quantities of mass spectral data. Together they 
contain approximately 8000 spectra. There is a great deal of duplicate spectra 
in these libraries. The duplications result from different laboratories contri- 
buting spectra of similar compounds. Such duplications in a library represent 
singularities in the mathematical analysis and were subsequently eliminated 
by a program designed to make these libraries unique . 

Cornu and Massot data give the 10 most intense peaks per sample 
against 6 peaks per sample in the ASTM data. More peaks per sample tend 
to make a compound easier to identify by this technique. But, as indicated 
previously, actual calibration data of a particular instrument represent the 
best type of library data. The library of mass spectral data used in the 
ensuing analysis ranged from 100 to 1000 samples, and represents a combina- 
tion of in-house calibration data with the above two sources . This library of 
data is stored on a magnetic disk of an IBM 1130 computer system. 
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In an attempt to establish the validity of the logical decisions for 
accepting or rejecting a library compound as existing in an unknown spectrum 
and simultaneously determine the effect of the regression calculations, a 
program for synthesizing mass spectra of mixtures was developed. A synthetic 
mixture by definition represents multiplying pattern coefficients of one sample 
by an appropriate percentage factor and adding to the pattern coefficients of 
another sample also multiplied by a percentage factor. In this way, 5 percent 
of one compound is added in the computer to 10 percent of another until 100 
percent of a combination is obtained. The resulting spectrum is normalized 
and presented to the computer as an unknown. 


Theoretical Results 


This exercise indicated the theoretical limitations of the technique on 
an IBM 1130 system ( 8-k core, 0. 5-million word disk storage, Table 4) . 

Test mixture 5 represents five compounds mixed together in equal proportion. 
The mass range goes from 12 to 44 and contains 17 peaks. The computer 
program can handle up to 125 peaks. The analysis is both qualitatively and 
quantitatively correct. Each compound contributes 20 percent to the mixture 
( 10 microtorr) . The input, calculated and residual spectra, indicates how the 
calculated pattern coefficients compare to the input pattern coefficients. The 
residuals represent the difference between input and calculated data. If the 
residual contribution is above 1 percent, the residual spectrum is represented 
as pattern coefficients. The serial numbers associated with the compounds 
are for purposes of computer organization and identification on the magnetic 
disk. 


A synthetic test mixture of eight compounds in equal proportion with 
28 peaks ranging from 12 to 46 amu represented the largest mixture the 
IBM 1130 could handle. The qualitative analysis was correct but round-off 
effects resulted in 0.5-percent error in quantitative mole percentages. This 
round-off error results from using Gauss Jordon methods for solving the 
normal equations that is the heart of the regression calculations. Eight 
compounds in a mixture represent the maximum number that can be properly 
analyzed at present. To encourage these theoretical data to take on a more 
experimental look, deviations to the mixture spectra were added before the 
normalization. Another test mixture of eight in equal proportion was syn- 
thesized and a ± 3-percent deviation was added to the intensities. The mass 
values ranges from 12 to 75 (Table 5) containing 28 peaks. The qualitative 
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analysis is almost correct. A 1. 05 percent of butadiene appeared in the 
analysis, when none was actually present in the mixture. The remaining 
eight chemicals were properly chosen. The quantitative calculations seem 
reasonably accurate considering the large deviations applied to the spectrum 
and the round-off error in the computer. The residuals are a further effect 
of the large deviations. A 1 -percent deviation applied to the same data and 
a 3-percent deviation applied to a mixture of seven compounds produced a 
correct analysis with very little change in the mole percentages. 

The residual calculation gives an indication of how ASTM and the Cornu 
and Massot library are useful. Using pattern coefficients from these sources 
will produce contributions to the residuals and possible inaccuracy in the 
qualitative determination. However, allowing for such errors, the data in 
these tabulations save many in-house hours of calibration and can produce a 
reasonable analysis. 

To further appreciate the significance of the residual calculations, 
another test was performed (Table 6) , in which a mixture of five compounds 
in equal proportion was synthesized; Hexadiene, Serial No. 53, was eliminated 
from the library data. The contribution to residuals should be due entirely to 
the missing compound. The mole percentage for residuals was properly cal- 
culated at 20 percent, and the residual spectrum is an excellent representation 
of the missing compound. Residuals clearly represent deviations because of 
differences in calibration data and an indication of compounds not included in 
the library and round-off effect in the computer calculations. 

The most promising aspect of this effort lies in the analysis of experi- 
mental test cases . By definition, an experimental test case represents a 
mixture concocted in the laboratory where mole percentages can be accurately 
determined. 


Experimental Equipment 


All test cases were measured on a Consolidated Electrodynamics 
Corporation Model 612 Residual Gas Analyzer. This cycloid tube [8] is not 
designed for analytical work, but represents a good experimental test of this 
technique . The instrument is specified as unit resolution at mass 44 based 
on a slit width of 5 mils defining the collector. Spectrum shape indicated a 
slit width of approximately 50 mils indicating that the spotwelded slits are 
missing, subsequently reducing the unit resolution to mass 20. 
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The reduced resolution limits the working range of the instrument to 
an upper limit of approximately 60 mass units (amu) . The lower limit is 12, 
the low mass range being too unreliable for quantitative work. 

Test mixtures were prepared in a glass inlet system. Quantitative 
mole percentages were difficult to determine because of the lack of absolute 
pressure calibration for the various gases used. Since this is only a feasibility 
study, all relative ionization probabilities for the various gases as measured 
by a Bayard Alpert gauge were assumed to be unity. Though this assumption 
is not too unreasonable for a number of gases, an undetermined error in 
estimated mole percentages does exist. The ion gauge was factory-calibrated 
only for nitrogen. All measured mole percentages are only approximate. The 
purpose of these experimental test mixtures is merely to indicate the feasibility 
of this approach. 

Data were abstracted from the equipment by a two-channel analog-to- 
digital-to-paper-tape-to-punch cards system.. As previously indicated, the 
entire effort is oriented around a small -size computer system. The computer 
generates the normalized data and calculates accurate mass numbers via 
standard calibration procedures and another regression technique. Background 
effects are eliminated by taking a scan of background prior to allowing the 
mixture spectrum into the spectrometer via a variable leak valve. Background 
and mixture data are then normalized by a factor proportional to the total 
number of ions collected (the total pressure) . This normalization factor is 
effectively the sum of the area under the peaks . An approximate integration 
scheme is used based on constant resolution over the usable mass range and 
rectangular peaks. The data are now proportional to sensitivity coefficients 
(divisions per unit pressure) , and background effects can be subtracted from 
the mixture gas independent of the orifice size on the variable leak. 


Experimental Results 


With a library of 123 compounds, a methane, C0 2 , argon, neon mixture 
presented an excellent example of the analysis (Table 7) . All of these com- 
pounds were calibrated as pure samples on the CEC 612 and included in the 
library data. The mass ranges from 12 to 44 including 9 peaks. These four 
samples represent a good degree of overlapping spectra that tends to make 
the analysis more difficult. The qualitative analysis is correct and the degree 
of agreement in the quantitative analysis is very good for a CEC 612 RGA. The 
indication of approximately known mole percentages is because of reading 
pressure valves on an ion gauge only calibrated for nitrogen. 
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With 12 peaks ranging from 12 to 45, the methane, nitrogen, carbon 
dioxide, neon, oxygen, argon represents an abundance of overlapping spectra. 
This mixture of six compounds ( Table 8) represents a very difficult case to 
interpret. The qualitative analysis is correct, choosing the proper six from 
a library of 150 compounds. All of the compounds in this mixture except 
oxygen represent in-house calibration data. For such a complex mixture, the 
quantitative analysis- is very encouraging. 

The normal butane, isobutane, 2-cis butane (Table 9) represents the 
most difficult experimental test case successfully analyzed to date. Normal 
butane and isobutane have very similar pattern coefficients with common base 
peaks at 43 amu. The degree of overlapping spectra and the abundance of 
compounds in this region tend to make the analysis very difficult. There are 
18 peaks with mass ranges of 13 to 58. Each of these spectra was individually 
calibrated on the CEC 612. With a library of about 150 compounds the analysis 
took about 90 seconds. The degree of agreement seems very favorable. 

Additional experimental test mixtures are included in Appendix A . 


FUTURE EFFORT 


The opportunity to gather an abundance of information from the residual 
gas analyzer has recently been made available. A seven-channel analog tape 
with appropriate operational amplifiers on each channel is being prepared to 
read various signals from the spectrometer. A DDP-116 will convert this 
information to a* digital signal. This future effort is aimed at improved 
quantitative analysis by having more reliable and accurate data . Applying 
some wave-shape considerations coupled with the instrument resolution (poly- 
nomial regression and multiparameter optimization techniques) should allow 
for accurate location of spectral position and intensity. 

The multichannel information should also allow for normalization of 
temperature variations, pressure excursions^ amplifier drift, control voltage, 
and current instabilities. This normalization should improve the stability of 
the data. 

At the heart of the regression calculations, a symmetric coefficient 
matrix is effectively being inverted. A number of techniques exist for solutions 
of simultaneous equations that result in better accuracy than Gauss-Jordan. 
Some of these techniques are presently being pursued and more complex test 
cases are being attempted. 
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CONCLUSION 


In conclusion, this effort applied to T-030 allows an ability to interpret 
the data sent back from the quadrupole mass spectrometer. This interpreta- 
tion of the data should aid in understanding the ambient atmosphere of the 
spacecraft, and increase the probability of detecting detrimental contaminants. 

There also appears to be an excellent opportunity to apply this technique 
as an aid to the experimentalist concerned with the ambient atmosphere sur- 
rounding optical contamination experiments. 


9 



MASS SPECTRAL DATA FOR N-BUTANE 
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TABLE 3. SIMILARITIES IN PATTERN COEFFICIENTS N-BUTANE 


M/E 

1 

II 

III 

IV 

V 

15 



5.00 

2.00 


26 


2.50 

6.00 


4.00 

27 

24.00 

22.50" 

37.00 

31.00 

28.10 

28 

27.50 

24.50 

32.50 

29.50 

27.10 

29 

37.50 

33.50 

44.00 

41.00 

38.00 

39 

12.00 

9.00 

12.50 

11.00 

10.50 

41 

33.00 

20.00 

27.50 

27.60 

27.50 

42 

16.00 

13.00 

12.00 

11.00 

11.50 

43 

100.00 

100.00 

100.00 

100.00 

100.00 

44 

3.50 

3.50 



3.00 

52 




2.50 


57 

3.50 





58 

27.00 

16. 50 

12.00 

20. 10 

11.00 


PATTERN COEFFICIENTS AS MEASURED BY 
FIVE DIFFERENT LABORATORIES. 


12 




TABLE 4. SYNTHETIC SAMPLE TEST MIXTURE 5 
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TABLE 5. SYNTHETIC SAMPLE TEST MIXTURE 8 
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TABLE 6. SYNTHETIC SAMPLE TEST MIXTURE 5 
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TABLE 7. EXPERIMENTAL SAMPLE - COz/METHANE/ ARGON/NEON 
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TABLE 8. EXPERIMENTAL SAMPLE — METHANE/N 2 / C0 2 /Ne/ 0 2 / ARGON 
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TABLE A-i. EXPERIMENTAL SAMPLE - N-BUTANE/ISO BUTANE 
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BASE PEAK = 0.2186E 01 
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45.0 1.04 0.74 0.30 45.0 

RMS RESIDUALS = 0.795E 00 
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RMS RESIDUALS= 0.524E01 



TABLE A -4. EXPERIMENTAL SAMPLE - AIR 
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BASE PEAK= 0. 1870E 02 



EXPERIMENTAL SAMPLE — METHANE/NEON 
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BASE PEAK = 0.3646E 01 



EXPERIMENTAL SAMPLE — NEON/CO ; 
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RMS RES I DUALS = 0.291E 00 
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32.0 0.14 0.00 3.19 32.0 

40.0 0.98 0.00 21.11 40.0 

44.00 100.00 99.85 3.06 44.0 

BASE PEAK = 0. 4661E 01 



TABLE A-8. EXPERIMENTAL SAMPLE — TEST 
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RESIDUALS* 4.90% HELIUM, 3.40% HYDROGEN. 
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22.0 12.04 10.89 51.23 22.0 

40.0 17.45 17.46 -0.51 40.0 

44.0 71.50 71.32 7.99 44.0 

BASE PEAK « 0.2238E 01 



TABLE A-10. EXPERIMENTAL SAMPLE — C0 2 /N-BUTANE/ARGON 
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BASE PEAK = 0. 3489E 01 



TABLE A-il. EXPERIMENTAL SAMPLE — AMMONIUM HYDROXIDE 
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40.0 404 494 0.00 40.0 

BASE PEAK -0.9500E 01 
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